arXiv: 1507.06616v3 [cs.DS] 5 May 2016 


Noname manuscript No. 

(will be inserted by the editor) 


Robust Monotone Submodular Function 
Maximization 


James B. Orlin • Andreas S. Schulz • 
Raj an Udwani 


Abstract We consider a robust formulation, introduced by Krause et al. 
(2008), of the classic cardinality constrained monotone submodular function 
maximization problem, and give the first constant factor approximation re¬ 
sults. The robustness considered is w.r.t. adversarial removal of up to r ele¬ 
ments from the chosen set. For the fundamental case of r = 1, we give a de¬ 
terministic (1 — 1/e) — f2(l/m) approximation, which makes 0(n m+1 ) queries. 
In the process, we develop a deterministic (1 — 1/e) — 12(1 /m) approximate 
greedy algorithm for bi-objective maximization of (two) monotone submod¬ 
ular functions. Generalizing the ideas and using a result from Chekuri et al. 
(2010), we show a randomized (1 — 1/e) — e approximation for constant r and 
e < -pJ—-, making 0{n ) queries. For r = o{\fk ), we give a fast and practical 
0.387 algorithm. Finally, we also give a result for a more general setting. 


1 Introduction 

A set function / : 2^ -> R on the ground set N is called submodular if, 

f(A + a) — f(A) < f(B + a) — f(B) for all B C A C N and a G N\A. 

The function is monotone if f(B) < f(A) for all B C A. We also impose 
/(0) = 0, which combined with monotonicity implies non-negativity. Opti¬ 
mization problems with submodular objective functions have received a lot 
of interest due to several applications where instances of these problems arise 
naturally. However, unlike the (unconstrained) minimization of submodular 
functions, for which polytime algorithms exist mm even the simplest max¬ 
imization versions are NP-hard HDHUm In fact, they encompass several 
fundamental hard problems, such as max-cut, max-fc-coverage, max-dicut and 
variations of max-SAT and max-welfare. 

A long line of beautiful work has culminated in fast and tight approxi¬ 
mation algorithms for many settings of the problem. As an example, for un¬ 
constrained maximization of non-monotone submodular functions, Feige et al. 


{jorlin,schulz,rudwani(9!mit.edu} 






2 


James B. Orlin et al. 


in Ha, provided an algorithm with approximation ratio of 0.4 and showed 
an inapproximability threshold of 1/2 in the value-oracle model. Extensions 
by Gharan and Vondrak [T5] and subsequently by Feldman et al. [14} led to 
further improvement of the guarantee (roughly 0.41 and 0.42, respectively). 
Finally, Buchbinder et al. in [7] gave a tight randomized 1/2 approximation 
algorithm, and this was recently derandomized [8]. 

Here we are interested in the problem of maximizing a monotone submodu- 

lar function subject to a cardinality constraint, written as: PI := max f (A) 

ACN,\A\<k ^ 

The problem has been well studied and instances of PI arise in several impor¬ 
tant applications, two of them being: 

Sensor Placement [ THI 1281122] : Given a large number of locations N, we 
would like to place up to k sensors at certain locations so as to maximize the 
coverage. Many commonly used coverage functions measure the cumulative 
information gathered in some form, and are thus monotone (more sensors is 
better) and submodular (decreasing marginal benefit of adding a new sensor). 

However, as highlighted in [21] , it is important to ask what happens if some 
sensors were to fail. Will the remaining sensors have good coverage regardless 
of which sensors failed, or is a small crucial subset responsible for most of the 
coverage? 

Feature Selection [T6l2lll23tt28j : In many machine learning models, adding a 
new feature to an existing set of features always improves the modeling power 
(monotonicity) and the marginal benefit of adding a new feature decreases as 
we consider larger sets (submodularity). Given a large set of features, we would 
like to select a small subset such that, we reduce the problem dimensionality 
while retaining most of the information. 

However, as discussed in [16M21] , in situations where the nature of un¬ 
derlying data is uncertain, leading to non-stationary feature distributions, it 
is important to not have too much dependence on a few features. Taking a 
concrete example from nu, in document classification, features may take not 
standard values due to small sample effects or in fact, the test and training 
data may come from different distributions. In other cases, a feature may even 
be deleted at some point, due to input sensors failures for example. Thus, 
similar questions arise here too and we would like to have an ‘evenly spread’ 
dependence on the set of chosen features. With such issues in mind, consider 
the following robust variant of the problem, introduced in EH, 

P2 := max min f(A — Z). 

A<ZN,\A\<k ZCA,\Z\<t 

Note that the parameter r controls the degree of robustness of the chosen 
set since the larger r is, the larger the size of subset Z that can be adversarially 
removed from the chosen set A. For r = 0, P2 reduces to the PI. Since 
this formulation optimizes the worst case scenario, a natural variation is to 
optimize the average case failure scenario m- However, this is not suitable for 
some applications. For instance, we may have no prior on the failure/deletion 
mechanism and furthermore, in critical applications, such as sensor placement 
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for outbreak detection sum, we want protection against the worst case. This 
form of worst case analysis has been of great interest in operations research 
and beyond, under the umbrella of robust optimization (e.g., EH3). The idea 
is to formulate the uncertainty in model parameters through a deterministic 
uncertainty set. While much work in this area assumes that the uncertainty 
set is a connected, if not convex set, the uncertainty set in P2, when t = 1 for 
instance, is the disconnected set of canonical unit vectors e* £ (1 at entry 

i, 0 otherwise). 


Previous work on PI and P 2. The first rigorous analysis of PI was by Nemhauser 
et al. urns] in the late 70’s, where they showed that the greedy algorithm 
gives a guarantee of (1 — 1/e) and that this is best possible in the value-oracle 
model. Later, Feige m showed that this is also the best possible under stan¬ 
dard complexity assumptions (through the special case of Max-fc-cover). On 
the algorithmic side, Badanidiyuru and Vondrak [li recently gave a faster al¬ 
gorithm for PI that improved the quadratic query complexity of the classical 
greedy algorithm to nearly linear complexity, by trading off on the approxima¬ 
tion guarantee. However, the optimality (w.r.t. approximation guarantee) of 
the greedy algorithm is rather coincidental, and for many complex settings of 
the problem (monotone or not), the greedy algorithm tends to be sub-optimal 
(there are exceptions, like [27]). An approach first explored by Chekuri et 
al. [8], that has been very effective, is to perform optimization on the multi¬ 
linear relaxation of the submodular function, followed by clever rounding to 
get a solution to the original problem. Based on this framework, tremendous 
progress has been made over the last decade for both monotone and non¬ 
monotone versions with various kinds of constraints [81l21 IT311221 - f?T] . In fact, a 
general framework for establishing hardness of many of these variants eisii, 
also relies intricately on properties of this relaxation. 

Moving on to P2, as we will see, the well known greedy algorithm and also 
the above mentioned continuous greedy approach for r = 0, can be arbitrarily 
bad even for r = 1. In fact, many natural approaches do not have a constant 
factor guarantee for r > 1. The paper by Krause et al. m , which formally 
introduced the problem, actually gives a bi-criterion approximation to the 
much more general and inapproximable problem: max min 

N , | A | ^ k i£{l,2,...,m} 

where /)(.) is monotone submodular for every i. Their algorithm, which is 
based on the greedy algorithm for PI, when specialized to P2, guarantees 
optimality by allowing sets up to size k( 1 + 6*(log(rfc log n))) and has runtime 
exponential in r. To the best of our knowledge, no stronger/constant factor 
approximation results were known for P2 prior to our work. 

Our contributions: We work in the value oracle model and give constant factor 
guarantees for P2 with combinatorial, ‘greedy like’ algorithms. Our initial 
focus is on a restricted case, where we construct relatively simple algorithms 
and get insights that generalize. For the special case, we give an asymptotically 
(1 — 1/e) algorithm for r = o(k). 
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In the non-restricted setting, we first focus on generalizing results for r = 1, 
for which we propose a fast and practical 0.5547 approximation and later an 
asymptotically (1 — 1/e) — 12(1 /m) algorithm. This relies on developing a new 
(1 — 1/e) — 12(l/m) approximate greedy algorithm for the bi-objective maxi¬ 
mization of monotone submodular functions subject to cardinality constraint, 
of which the problem of maximizing the minimum of two arbitrary monotone 
submodular functions is a special case. We conjecture that the greedy algo¬ 
rithm can be generalized to work for multi-objective maximization of a fixed 
number of monotone submodular functions. There has been previous work on 
the multi-objective problem for a constant number of monotone submodular 
functions and a randomized (1 — 1/e) — e algorithm for (a much more general 
version of) this problem was given in [5]. It uses the continuous greedy algo¬ 
rithm from [31| . along with an innovative dependent rounding scheme called 
swap rounding , and we use this algorithm as a subroutine to get a randomized 
(1 — 1/e) — e approximation for the robust problem when r is constant. For 
r = o(Vk ), we give a fast and asymptotically 0.387 algorithm. 

In the more general case, where we wish to find a robust set A in an 
independence system, we extend some of the ideas from the cardinality case 
into an enumerative procedure that yields an a/(r + 1) approximation using 
an a approximation algorithm for r = 0 as a subroutine. However, the runtime 
scales as n T+1 . 

The outline for the rest of the paper is as follows: In Section 2, we introduce 
some notation and see how several natural ideas fail to give any approximation 
guarantees. In Section 3, we start with a special case and slowly build up to 
an algorithm with asymptotic guarantee (1 — 1/e) — e for fixed r, covering the 
other results in the process. Finally, in Section 4, we extend some of the ideas 
to more general constraints. Section 5 concludes with some open questions. 


2 Preliminaries 

2.1 Definitions 

We denote an instance of P 2 on ground set N with cardinality constraint 
parameter k and robustness parameter r by ( k,N,r). Subsequently, we use 
OPT(k,N,r) to denote an optimal set for the instance ( k,N,r ). For any 
given set A , we call a subset Z a minimizer if f(A — Z) = min f(A — B). 

Also, let Z(A) be the set of minimizers of A. When r = 1, we often use the 
letter z for minimizers. In fact, we generally refer to singleton sets without 
braces {} and use + and U interchangeably. Next, consider the set function 
g T (A ) = f(A — Z). We simply use g(.), when r is clear from context. Also, 
define the marginal increase in value due to a set X, when added to the set A 
as f(X\A) = f{A U X) - f{A). Similarly, g(X\A) = g(A U X) - g(A). 

Let (3(r],a ) = for g G [0,1], a > 0. Note that /3(0,1) = (1 — 1/e). 
This function appears naturally in our analysis and will be useful for expressing 
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approximation guarantees of the algorithms. Next, recall the widely popular 
greedy algorithm for PI: 

Algorithm 1 Greedy Algorithm 
1: Initialize A = 0 

2: while |A| < k do A = A + argmax/(a;|A) 

3: Output: A xeN-A 

Let k = n in the above and denote the i-th element added by a^. Us¬ 
ing this we index the elements in A in the order they were added, so A = 
{ai ,a 2 ,...,o n }. 

Also, recall the following theorem: 

Lemma 1 (Nemhauser, Wolsey |24LI25| f For all a > 0, the greedy algo¬ 
rithm terminated after ak steps is /3( 0, a) approximate for instances (fc, A, 0). 

In addition, the following lemma, which compares the optimal value of 
(k, A, t) with ( k — r, A, 0) will be very useful: 

Lemma 2 For instances (k,N,r), we have: 

g T (OPT(k, A, r)) < f(OPT(k -t,N-X, 0)) < f(OPT(k - r, A, 0)), 
for all X £ A, |A'| < r. 

Proof We focus on the first inequality since the second follows by defini¬ 
tion. Let x = | A fl OPT{k, A, r)| < r, then note that g T (OPT(k,N,r)) < 
g r - x (OPT(k, N,t) — A), since the RHS represents the value of some sub¬ 
set of OPT(k, N,t) of size k — t, which is > the LHS by definition. Now, 
g T - x (OPT(k, N,t) — X) < g T _ x (OPT(k — x, A — A, r — a:)) by definition. 
Finally, note that g T - x (OPT(k — x, A — A,r — x)) < f{OPT(k — r, A — X, 0)) 
since the LHS represents the value of a set of size k — x — (r — x) = k — r that 
does not include any element in A', giving us the desired. □ 

Finally, it is reasonable to expect that we cannot approximate P 2 better than 
PI (which is approximate up to a factor of /3(0,1)) and this is formalized 
below. 

Lemma 3 There exists no polytime algorithm with approximation ratio greater 
than (1 — 1/e) for P 2 unless P = NP. For the value oracle model, we have 
the same threshold, but for algorithms that make only a polynomial number of 
queries. 

Proof We will give a strict reduction from the classical problem PI (for which 
the above hardness result holds CM) to the robust problem P2. Consider an 
instance of PI, denoted by (k, A, 0). We intend to reduce this to an instance 
of P2 on an augmented ground set A U A i.e. (k + t, A U A', r). 

The set A = {aq, • • • , x T } is such that f(xi ) = (k + l)/(ai) and f(xi\S) = 
f(xi) for every i and S C A U X not containing Xi. Using this, we will show 
that g{OPT{k + r, A U A, r)) = f(OPT{k, A, 0)). 








6 


James B. Orlin et al. 


First, note that for arbitrary set S = Sn U Sx, such that |5| = k + r 
and Sx = S (1 X, we have that every minimizer contains Sx- This follows by 
definition of X, since for any two subsets P,Q of S with |P| = \Q\ = k and 
P disjoint with X but Q fl X ^ 0, we have by monotonicity f(Q) > f{xi) = 
(k + l)f(ai) > kf{a i) and by submodularity kf(a i) > f(P). This implies 
that X is the minimizer of OPT(k, AT, 0) U X and hence f(OPT(k,N, 0)) < 
g{OPT(k + r, N U X, r)) 

For the other direction, consider the set OPT(k + t, N U X,t) and define, 
M = OPT(k + r, N U X, r) n X 

Next, observe that carving out an arbitrary set B of size r— \M\ from OPT(k+ 
r, N U X, t) — M will give us the set 

C = OPT(k + t,NuX,t)-M-B 

of size k -\~ t — (\M\ + r — \M\) = k. Also note that by design, C C TV and 
hence f(C) < f(OPT(k,N, 0)), but by definition, we have that g(OPT(k + 
t.N UX, t)) < f(C). This gives us the other direction and we have q(OPT(k + 
t,N U X, t))~= f (OPT(k,^N, 0)). 

To complete the reduction we need to show how to obtain an a-approximate 
solution to (. k , N, 0) given an a-approximate solution to (k + r, TV U X, r). Let 
S = Sn U Sx be such a solution i.e. a set of size k + r with Sx = S n X, such 
that g(S) > ag(OPT(k + t,N U A, r)). Now consider an arbitrary subset S' N 
of Sn of size r —|5x|- Observe that IS'jv — 'S'atI = 151 — |5xJ — {t— |5x|) = k and 
further /(Sn — S' N ) > g(S) > ag(OPT(k + r, N U X , r)) = af(OPT(k,N, 0)), 
by definition. Hence the set Sn — S' N C N is an a-approximate solution to 
(k, N, 0) and we are done. □ 


2.2 Negative Results 

The example below demonstrates why the greedy algorithm that does well 
for instances of PI, fails for P2. However, the weakness will also indicate a 
property which will guide us towards better guarantees later. 

Example: Consider a ground set N of size 2k such that f(a\) = 1, /(a/) = 0, 
V2 < i < k and f(aj) = Vj > k + 1. Also, for all j > k + 1, let f(aj |A) = ^ 
if X fl {ai,a,j} = 0 and 0 otherwise. Consider the set S = {afc+i,--- ,a, 2 k} 
and let the set picked by the greedy algorithm (with arbitrary tie-breaking) 
be A = {m, • • • , a*,}. Then we have that f(A — a\) = 0 and f(S — aj) = 1 — ^ 
for every aj G S. The insight here is that greedy may select a set where only 
the first few elements contribute most of the value in the set, which makes it 
non-robust. However, as we discuss more formally later, such a concentration 
of value implies that only the first two elements, { 01 , 02 }, are critical and 
protecting against removal of those suffices for best possible guarantees. 

In fact, many natural variations fail to give an approximation ratio better 
than 1 /(k— 1). Indeed, a guarantee of this order, i.e. l/(k — r), is achievable 
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for any r by the following naive algorithm: Pick the k largest value elements. 
It is also important to examine if the function g is super/sub-modular, since 
that would make existing techniques useful. It turns out not However, it is 
monotonic. Despite this, it is interesting to examine a natural variant of the 
greedy algorithm, where we greedily pick w.r.t g, but that variant can also be 
arbitrarily bad if we pick just one element at each iteration. 


3 Main Results 

Before we start, note that the focus of these results is on asymptotic perfor¬ 
mance guarantee (for large k). In some cases, the results can be improved for 
small k but we generally ignore these details. 

Additionally, in every algorithm that uses the greedy algorithm (Algorithm 
[l|) as a subroutine, especially the fast and practical algorithms [4| [2] and [3 we 
can replace the greedy addition rule of adding x = argmax^ggj f(x\S 2 ) for 
some Si , S' 2 ■ by the more efficient thresholding rule in [T], where, given a 
threshold w, we add a new element x € Si if /(ajS^) > w. This improves the 
query/run time to 0(j log j), at the cost of a factor of (1 — e) in the guarantee. 


3.1 Special case of “copies” 

We first consider a special case, which will serve two purposes. First, it will 
simplify things and the insights gained for this case will generalize well. Sec¬ 
ondly, since this case may arise in practical scenarios, it is worthwhile to discuss 
the special algorithms as they are much simpler than the general algorithms 
discussed later. 

Given an element a: € A, we call another element x' a copy of element x 

if, 

f(x r ) = f{x) and f(x’\x) = 0 

. This implies f(x\x') = f({x, x'}) - f(x') = f(x) + f{x'\x) - /( x') = 0. 
In fact, we have more generally, f(x'\A) = f(x\A) for every A C N, since 
f(A + {x, x'}) = f(A + x) = f{A + x'). This is a useful case for robust sensor 
placement, if we were allowed to place/replicate multiple sensors at certain 
locations that are critical for coverage. For the rest of this sub-section, assume 
that each element in N has r copies. In the next section we discuss algorithms 
for this special case when r = 1. This will help build a foundation, even though 
the results therein are superseded by the result for r = o(k). 

3.1.1 Algorithms for r = 1 in presence of “copies” 

Let of denote the copy of element a^. As indicated previously, we would like 
to make our set robust to removal of critical elements. In the presence of 
copies, adding a copy of these elements achieves that. So as a first step, let’s 
construct a set that includes a copy of each element, and so is unaffected by 
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single element removal. One way to do this is to run the greedy algorithm 
for k/2 iterations and then add a copy of each of the k/2 elements. Then, 
it follows from Lemma Q] that g(S) = f(S) > 0(0, ^^)f(OPT(k — 1,N, 0)) 
and then from Lemma [2]we have, g(S) > (1- \=) g(OPT(k, N, 1)), where we 

use the fact that 0(0, jrf) > 0(0, 0.5) = (1 — -^). Hence, we have « 0.393- 
approximation and the bound is tight. We can certainly improve upon this, 
one way to do better is to think about whether we really need to copy all k/2 
elements. It turns out that just copying { 01 , 02 } is enough. Intuitively, if the 
greedy set has value nicely spread out, we could get away without copying 
anything but nevertheless, in such a case copying just two elements does not 
affect the value much. Otherwise, as in the example from Section 2, if greedy 
concentrates its value on the first few elements, then copying them is enough. 

Before we state and prove this formally, consider the below corollary: 
Corollary 4 Let A be the final set obtained by running the greedy algorithm 
for l steps on an initial set S. Then we have, 

f(A-S\S) > p(0, l -)f(OPT(k, N, 0)|S) > /?(0, (f(OPT(k,N,Oj)-f(S)) 

Proof Follows from Lemma [I] since /(.|5) is monotone submodular for any 
fixed set S and union of greedy on f(.\S) with S is the same as doing greedy 
on /(.) starting with S. □ 

Now we state and prove a simple yet key lemma, which will allow us to quantify 
how the guarantee of greedy algorithm w.r.t. f(OPT(k,N, 0)) improves over 
(1 — 1/e), as a function of how concentrated the value of the greedy set is on 
the first few elements. 


Lemma 5 Starting with initial set S, run l iterations of the greedy algorithm 
and let A be the output (so \A\ = l + 151}. Then given f(S) > cf(A) for some 
c < 1 (high concentration of value on S implies large c), we have, 

f(A) > p(c, t)f(OPT(k,N, 0)), for all k > 1 

In a typical application of this lemma, we will have S be the first s = |Sj 
elements of the greedy algorithm on 0 and c = sg for some g < 1/s. 


Proof Denote OPT(k , N, 0) by OPT. With the sets A, S as defined above, let 
S = f(Qp T ) , which implies f(S) > cdf(OPT) (by assumption). 

Now from Corollary [4] we have, f(A) = f(S) + f(A — 5|S) > f(S) + 
m i)(f(OPT) - f(S)) and thus, 


SfiOPT) > (1 - >3(0, {)) x c6f(OPT) + 0(0, {)f(OPT ) 
6(l- C (l-0(O,{)))>0(O,i) 


<5(1 - c-iyr) > 


e l ' k -l 

e l/k 


f(A)>0{c,i)f(OPT(k,N,O)) 


[substitution] 
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□ 


Now, consider the algorithm that copies the first two elements and thus out¬ 
puts: {ai,a' 1 ,a 2 ,a' 2 ,a 3 ,... ,ak- 2 }- Call this the 2-Copy algorithm. Using the 
above lemma, we show that this algorithm gives us the best possible guarantee 
asymptotically, in presence of copies, aligning with the intuitive argument we 
presented earlier. 

Theorem 6 For the case with copies, 2-Copy is 0(0, tEj) fc ^° c '> (1 — 1/e) 
approximate. 


Proof First, denote the output as A = {ai, a}, ti 2 , o! 2 , < 23 ,..., 0 ^- 2 } and notice 
that f(A) = /({ai, 02 ,..., ak- 2 })- As a warm-up, using Lemma [2 we get, 

/(A) > 0(0, tll)f{OPT{k - 1, N, 0)) > 13(0, )g(OPT(k , N, 1)) (1) 

, where the second inequality follows from Lemma [2j 

Let z be a minimizer of A. We can assume that z {oi, a' 1: 02 , 02 } since all 
of these have 0 marginal. Now let f(z\A — z) = r)f(A). We have due to greedy 
additions and submodularity, /(a 3 |{ai, 02 }) > f(z\{a±, 02 }) > f(z\A — z) and 
similarly,/({oi, 02 }) > 2f(z\A—z). This implies that /({ai. 02 , 03 }) > 3r]f(A), 
which relates the value removed by a minimizer z to the value concentrated 
on the first 3 elements {oi, 02 , 03 }. Higher the value removed, higher the con¬ 
centration and closer the value of f(A) to f(OPT(k— 1, N, 0)). More formally, 
with S = {ai, a[, 02 , a' 2 , 03 } and c = 3?y (77 < 1/3), we have from Lemma[5j 

f(A) > 0 ( 377 , f(OPT(k - 1, N , 0)) > 0 ( 377 , tj—^)g(OPT(k, N, 1)) 


. Which implies that g(A) = (1 - r])f(A) > (1 - r])(3(3r], )g(OPT(k , N, 1)) 
Now, we show that the factor (1 — 77 ) 0 ( 377 , ) is minimized for 77 = 0, which 

k — 5 

finishes the proof. For ease of notation, let e denote e k - 1 , then we have, 


( 1 - 


k- 5 
k- 1 




> 


- 3t? 

3 - e' 


- 1)1 + 


3 — e' 
e' — 377 


) 


5 (e '~ i) ( i+ ^ 7 £ ‘) =9-1/0 =77(0. Je 4 ) 


As an example, for k > 55 the value of the ratio is > 0.6. Additionally, we can 
use different bounds for small k to get better guarantees in that regime, since 
for * £ { 07 , 02 }, f(A -z)> /({oi, o 2 }) > (1 - (1 - ) 2 )g(OPT(k , N, 1)). 
□ 


The result also implies that for large k, if the output of Algorithm[l](the greedy 
algorithm) has a minimizer ai with i > 3, then the greedy set is « (1 — 1/e) 
approximate for (k,N, 1). This is because, for such an instance we can assume 
that the set contains copies of oi, 02 and then apply Theorem [ 6 ] Moreover, if 
we copy the first i elements for * > 3, we get the same guarantee but with 
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worse asymptotics, so copying more than first two does not result in a gain, 
On the other hand, copying just one element, oi, gives a tight guarantee of 


0.5. 


Since (1 — 1/e) is the best possible guarantee achievable asymptotically, we 
now shift focus to case of r > f but <C k, and generalize the above ideas to 
get an asymptotically (1 — 1 /e) approximate algorithm in presence of copies. 

3.1.2 (1 — 1/e) Algorithms for r = o{k) in the presence of “copies ” 

Assume we have r copies available for each a,j G N. As we did for r = 1, we 
would like to determine a small critical set of elements, copy them and then add 
the rest of the elements greedily to get a set of size k. In order to understand 
how large the critical set should be, recall that in the proof of Theorem [G] 
we relied on the fact that /({a 1 , 02 }) is at least twice as much as the value 
removed by the minimizer, and then we could use Lemma [5] to get the desired 
ratio. To get a similar concentration result on the first few elements for larger 
t, we start with an initial set of size 2 r and in particular, we can start with 
A 2t = {cii, 02 ,..., <X 2 t}- Additionally, similar to the 2-Copy algorithm, we also 
want the set to be unaffected by removal of up to r elements from A 2t ■ We do 
this by adding r copies of each element in A 2r . More concretely, consider the 
algorithm that greedily picks the set A k _ 2r 2 = {ai, a 2 , ..., a k _ 2r 2 }, and then 
copies each element in A 2t = {ai, a 2 , ..., a 2 T } C A k _ 2r 2 , r times, resulting in 
r + 1 instances of each element in A 2t (counting the element itself). Denote 
the set of copies by C(A 2t ) then we have \C(A 2t )\ = 2 r 2 . Thus, the algorithm 
outputs the set 


A — A 2t U C(A 2t ) U {a2r+l) • ■ • > Ofe-2r 2 }. 


Observe that this is the same as the 2-Copy algorithm when r = 1. 

We next show that this algorithm is /3(0, k ~ 2 k _f 3r ) (1 — 1/e) ap¬ 

proximate. 

Proof We can assume that Z D (A 2t U C(A 2t )) = 0 or alternatively Z C 
A k _ 2t 2 — A 2t , since there are r + 1 copies of every element (counting itself) 
and Z cannot remove all. 

Next, index elements in Z in the order they were added to A. Then we 
have a mapping n : {1 ,..., r} —> {2r + 1,..., k — 2r 2 }, such that tt( i) > n(j) 
for i > j and is the 1-th element in Z. Then with A,; = {ai,..., a^} for 
all 1, we have by submodularity, the following set of inequalities, 


y(^7r(i) |“^-7r(i) — l) f{^Tr(i)\A "(^ 77 ( 1 ), * • * , ^k—2 r 2 }) 

A f (^ 77 ( 7 ) | A > * ■ * ; ^ 7 r(r) }) W 


r 



(2) 


where the RHS in (J2J is by definition. 
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Next, for arbitrary i £ {1,..., r} = [r] and j £ {1,..., 2r} = [2r], note 
that 7r(i) > 2r, then due to greedy iterations we have that /(a,r(i)|^4 ff (i)_i) < 
So consider any injective mapping from i £ [r] to 2 distinct el¬ 
ements i \, i 2 £ [2t], for instance = *, *2 = * + r. We rewrite the previous 
inequality as, 

2/(^7r(i) |^4-7r(*) — 1) — f {P'ii l^ii_ 1) /(Uj 2 —l) 

Summing over all i 1 along with d2|) above gives, 

2f(Z\A-Z)<f(A 2r ) (3) 

where the RHS is by injective nature of mapping and definition of /(.|.). 

In fact, we have 7r(«) > 2 r+i and thus, /(a^lA^^) < /(a 2T +i|^ 2 r+i-i)- 
From this, we have for the set A 3t — An = {a 2 T +i> • • ■, a 3 r} that, 

f(Z\A-Z)<f(A 3T -A 2T \A 2T ) (4) 

© and combined give us, 

f(A 3T )>3f(Z\A-Z) 

We have from Lemma[2]that f{OPT(k — r, N, 0)) > g(OPT{k, N, r)) and so 
using Lemma 0 with k — 1 replaced by k — r, S = A 3t U C(A 2t ), s = 3 and 
l = k — \S\ = k — 2r 2 — 3r gives us, 

f(A -Z)> 0(O, k ~^~ 3T )g(OPT(k, N , r)). 

□ 

Note that we compared the value g(OPT(k, TV, r)), which is the /(.) value 
of a set of size k — r, to the /(.) value of a set of size k — @(r 2 ) (since the @(r 2 ) 
elements added as copies do not contribute any real value). Note that * 

converges to 1 only for r = o(\fk) and it is this degradation that creates the 
threshold of o(\fk). 

However, it turns out that we don’t need to add r copies of each element 
in A 2t . Intuitively, the first few elements in A 2t are more important and for 
those we must add r copies, but the later elements are not as important and 
we can add fewer copies. In fact, we can geometrically decrease the number 
of copies we add from r to 1, over the course of the 2r elements, resulting 
in a total of Q(j logr) copies. The resulting approximation ratio converges to 
(1 — 1/e) for r = o(k). 

More concretely, consider the following algorithm, 

Algorithm 2 (1 — 1/e) Algorithm for copies when r = o[k) 

1: Initialize A = — 1 

2: while i < [log2r] do 

3: A = A U (|’t/ 2 4—1 "| copies for each of — ..., a 2 i+i_ 2 }); £ = £ + 1 

4: while \A\ < k do A = A + argmax/(fc| A) 

5: Output: A xeN-A 
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Essentially, we start with the set A. 2 r , add r copies each for { 01 , 02 }, r/2 
copies for each of { 03 ,..., ag}, t/4. copies for each of { 07 ,, 014 } and so on, 
finally adding the rest of the k — 2r(log 2r + 1) elements greedily. Notice that 
we could get the best possible guarantee with a minimizer oblivious algorithm 
i.e., the output is independent of the minimizer at any stage of the algorithm. 

Theorem 7 Algorithm [H is /3^0, fc ~ 2r d°s^ T + 3 / 2 ) k^oo^ ^ _ \j e ^ approxi¬ 
mate for t = o{k). 

Proof The basic outline of the analysis is similar to that of the previous one 
for r = o{Vk ), so we focus on the main differences. Note that if Z fl A 2 T = 0, 
then we have m and <0 as before and thus f(A 3r ) > 3(Z\A — Z). By applying 
Lemma [5] with l = k — 2r(log 2r + 1) this time, we get the desired. 

However, unlike the previous analysis, here Z(lA 2 T need not be empty since 
we have < r copies for many elements. Let Bj = {a 2 *- 1 , • ■ • , 02 i + 1 - 2 } he., the 
elements for which we add r/ 2'~ 1 copies in the algorithm and let Cj denote the 
set of copies of these elements. Now note that |Cj| > 2r and so Z will remove at 
most |Bi 1/2 — 1 from Bi since it’s pointless to remove an element from Bj unless 
all its copies can be removed as well. Now, |Bj| = 2 l ,\/i < [log2r] and hence, 
|-Bj| = 2(|Bj|/2 — 1). Suppose that Z C Bi U Cj for some i < [log2r], 
then by arguing as we did for (HJ), we have that /(Up}B.j) > 2f(Z\A — Z). 
This implies, /(A 2t ) > /(Up^Bj) + f(Z\ Up} Bj) > 3f{Z\A - Z) and by 
using Lemma [5j we are done. The more general case of arbitrary Z , where we 
let Zq = Z fl A 2 T and Z\ = A — Zq 7 is more technical but follows similarly to 
the above and we have that f{A 3r ) > 3f(Zi\A — Z\) + 3f(Z 0 \A — Z). □ 

3.2 Algorithms in the possible absence of “copies” 

The problem we must deal with now is that in general we won’t have copies. 
However, as we argued, if we can ‘modify’ the greedy set, so that it is robust 
to removal of the few critical elements, we get the best possible guarantee (for 
large k ). We see how to do this for r = 1 next. 

3.2.1 Algorithms for t = 1 in the possible absence of “copies’' 

We start by discussing how one could construct a greedy set that is robust 
to the removal of just a\. One approach would be to pick ai and then pick 
the rest of the elements greedily while ignoring a\, since that would greedily 
choose a copy of if it exists, and if not, it will allow the selection of elements 
which have small marginal on a\, but large value in absence of it (and so would 
not have been selected by the usual greedy algorithm that doesn’t ignore ai). 
Formally, 

Algorithm 3 0.387 Algorithm 

1: Initialize A = {ai} 

2: while |A| < k do A = A + argmax/^IA — {ai}) 

3: Output: A xeN-A 
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This simple algorithm is in fact, asymptotically 0.387 approximate (we 
omit the proof here). However, a possible issue with the algorithm is that 
it is oblivious to the minimizers of the set at any iteration. It ignores a\ 
throughout, even if ai stops being the minimizer after a few iterations. Thus, 
if we track the minimizer and stop ignoring ai when it is not a minimizer, 
we achieve a performance guarantee of 0.5 (which matches the guarantee ob¬ 
tained by copying just ai in presence of copies). Next, just as we saw for the 
case of copies, in order to get even better guarantees, we need to look at the 
set { 01 , 02 }. A direct generalization to a rule that ignores both ai and 02 , i.e. 
argmax/(x|A— {oi, 02 }), can be shown to have a performance bound less than 

xEN-A 

0.5. In fact, many natural addition rules result in upper bounds < 0.5. Algo¬ 
rithm [4] avoids looking at both elements simultaneously and instead ignores ai 
until its marginal becomes sufficiently small and then does the same for 02 , if 
required. 


Algorithm 4 0.5547— 0(1/k) Algorithm 

1: Initialize A = { 01 , 02 } 

Phase 1: 

2: while |A| < k and /(ai \A — ai) > AAI do A = A + argmax/(x|A 

xEN—A 

Phase 2: 

3: while |A| < k and f(a, 2 \A — 02 ) > AAl do A = A + argmax/(:c|A 

xEN—A 

Phase 3: 


4: while |A| < k do A 
5: Output:A 


A + argmax/(a;| A) 
x£N—A 


01) 

02) 


The algorithm is asymptotically 0.5547-approximate (as an example, guar¬ 
antee > 0.5 for k > 50) and note that it’s minimizer oblivious and only uses 
greedy as subroutine, which makes it fast and easy to implement (recall that 
greedy can be replaced by thresholding). 

Theorem 8 Algorithm^ is 0.5547 — f2(l/k) approximate. 

Proof Let A denote the output and A 0 C A denote { 01 , 02 }. Due to sub¬ 
modularity, there exists at most two distinct x £ A with f(x\A — x) > 
Additionally, for every x £ A 0 , we have that f(x\S) < /(ai) and f(x\S) < 
/(a 2 |ai) for arbitrary subset S of A containing A®. This implies that that 
2f(x\S) < f(A 0 ) < f(S ), which gives us that f(x\S) < ^ S 3 +3 ^ . 

Note that due to condition in Phase 1, the algorithm ignores ai even if it is 
not a minimizer, as long as its marginal is more than a third the value of the 
set at that iteration. At the end of Phase 1, if 02 has marginal more than third 
of the set value, then it is ignored until its contribution/marginal decreases. 
Phase 3 adds greedily (without ignoring any element added). As argued above, 
no element other than ai, 02 can have marginal more than a third of the set 
value at any iteration, so during Phase 2 we have that 02 is also a minimizer. 

We will now proceed by splitting into cases. Denote g(OPT(k, N, 1)) as 
OPT and recall from Lemma [21 OPT < f(OPT(k—l,N—ai,0)) for i £ {1,2}. 
Also, let the set of elements added to A 0 during Phase 1 be U = {ui,..., u p }, 
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similarly elements added during Phases 2 and 3 be V = {vx, ■ ■ ■ ,v q },W = 
{w\,... ,w r } respectively, with indexing in order of addition to the set. Fi¬ 
nally, let a p = , a q = f, a r = ^ , a = + a q + ay = , and 

assume k > 8. 

Case 1: Phase 2,3 do not occur i.e. p = k~2,q = r = 0. 

Since we have, 

f(A - ffll ) > /(a 2 ) + /?(0, |-|) (f(OPT(k -l,N- ai ,0))- f(a 2 )) 

> /?(0, 1 ^)f{OPT{k -1,N- 0l , 0)) 

where (a) follows from Lemma [3] and ( 6 ) from Lemma [2j This deals with the 
case z = a\. If z = u p , we have, 

f(A - u p ) > f(A - u p - ax) > ^—^)f(OPT(k - 1, N- oi,0)) 



where (c) is like J5|) above but with k — 2 replaced by k — 3 when using Lemma 
|TJ Finally, let z ^ {ai, 7 t p }, then due to the Phase 1 termination criteria, we 
have f(ai\A — a\ — u p ) > f(A — u p )/3 , which implies that, 

2/(ai|A-ai - Up) > f(A-ai-u p ) ( 6 ) 

Now letting ?? = > we have by submodularity f(z\A — z) < 

i)f{A — ax — u p ) and by definition f(A — a\ — u p — z) = (1 — p)f(A — a i — it p ). 
Using the above we get, 

(d) 

f{A - z) > max{/(ai), f(A -u p - z)} 

> max{/(’;| J 4 — ax — u p — z), f(A — ai — u p — z) + f(ax\A — ax — u p — z)} 

> ma x{r]f(A - ax - u p ), (1 - v)f(A - ax - u p ) + f(ax\A - ax - u p )} 

>max{j],(l-r]) + -}f{A-ax-u p ) (7) 

where (d) follows from monotonicity and the fact that ax £ A— z and A — u p — 
z C A — z. Now, from Lemma [5] with S = {a 2 ,ui}, l = p — 2, k replaced by 
k — 1, N by N — ax and c = 1, we have, f(A — ax — u p ) > j3(rj, a p )f(OPT(k — 
1, N — ax, 0)) > /3{rj, a p )OPT. Substituting this in dJJ) above we get, 

f(A - z) max{!j, | - r]}/3(r), a p )OPT 
>^Q,a p )0PT [77 = 3/4] 

> /3(0, a p )OPT = /3(o, ^)oPT 


(8) 
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Case 2: Phase 2 occurs, 3 doesn’t i.e. p + q = k — 2 and q > 0. 

As stated earlier, during Phase 2, 02 is the minimizer of A 0 Ut/U(U— v q ). We 
have g(A) = f(A-a 2 ) > f{A-v q -a 2 ) = f(ai+U)+f(V-v q \ai+U).Further, 
since the addition rule in Phase 2 ignores a 2 , we have from Lemma [Tj f(V — 

v q \ ai + U)> j3(0, a q )(OPT - /(01 + U)), and f(a x + U) > fi(o, OPT. 
Using this, 


g(A-v q ) 

m-z) 

OPT 


> /(«! + u) + P(0, a q )(OPT - f( ai + U)) 





P + q-1 
k-1 



■'j + (3(0, a q ) 

k - 3\ 
k-l) 


(9) 


Case 3: Phase 3 occurs i.e., r > 0. 

We consider two sub-cases, z € A — W and z G W. Suppose z G A — W. 
Due to f(z\A — W — z) < f(A — W)/ 3 we have, f(A — W) < § f(A —W — z). 
Also, f(W\A —W — z) > f(W\A -W)> (3(0, a r )(OPT - f(A - W)). Then 
using this along with the previous cases, 


f{A-z) 


f(A-z) 

OPT 


= f(A-W - z) + f(W\A -W-z ) 

> f(A — W — z) + (3(0, a r )(OPT - f(A - W)) 

> f(A — W — z) + (3( 0, a r )(OPT - 'p(A -W-z)) 

> (1 - |/?(0, a r ))f(A — W — z) + (3(0, a r )OPT 

> (1 — |/3(0, a r ))(3( 0, a p + a q )OPT + (3(0, a r )OPT [from ([EL©] 

> 0.5 - -2- + l(e~^ +a A + e _Qr ) 

2 e Q 2 V ; 

> 0.5 — —- + e~ a ^ 2 [for a r = a p + a q = a/2] 

= 0.5- — + 0.5547 


Now, suppose z G W, then note that for p+q > 6 we have either p > 3, and 
hence due to greedy additions f(z\A—z) < f({ui,u 2 ,U3})/3< f(A— W)/3, or 
q > 3, and again due to greedy additions f(z\A — z) < f(ai+UU{vi,v 2 })/3 < 
f(A-W)/ 3. 

If q > 0, then note that f(A—W) > f(A—W—v q ) > | f(A — W — v q — a 2 ) 
due to the Phase 2 termination conditions. Now we reduce the analysis to look 
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like the previous sub-case through the following, 

f(A - z) = f(A - W) + f(W\A - kb) - f(z\A - z) 

> f(A - W) + 0(0, a r )(OPT - f(A - W)) - f(z\A - z) 

> (1 - 0(0, ar))f(A -W) + 0(0, a r )OPT - f(A - W)/ 3 

> (l - §0(0, a r )) \f{A - W) + 0(0, a r )OPT 

> (l - §0(0, a r )) f (A — W — v q — a 2 ) + 0(0, a r )OPT 

Which since f(A — W — v q — a 2 ) > 0^0, p ^~ 1 ^OPT from ©, leads to the 
same ratio asymptotically as when z € A — W. The case q = 0 can be dealt 
with similarly by using f(A — W) > f(A — W — u p ) > §f(A — W — u p — ai) 

If p + q < 6, then let f(z\A — z) = rjf(A). Now we have, f(A — W) > 
f(A 0 ) = /(ai) + /(a 2 |ai) > 2f(z\A — z) = 2rjf(A), which further implies 
that f(A — W + w\) > 3r]f(A) since z £ W. Then proceeding as in Lemma 
[5] with k replaced by k — 1, S = A — W + w\ and hence s = 3 and finally 
l = k— \S\ = k— (p + q + 2 + l)<k — 8 gives us, 

f(A) > /3(3?7, )f(OPT(k - 1, N, 0)) > P(3 V , )OPT 
Then similar to the analysis in Theorem[G] f(A—z) > (1— 7 y)/ 3 ( 3 ? 7 , jr^)OPT > 

P(0, jEj)OPT. □ 

Recall that we wish to make the final set robust to removal of either one of 
the two elements {a±, a 2 }. In order to improve upon the guarantee of Algorithm 
[4l we need to devise a way to add new elements while paying attention to both 
ai and a 2 simultaneously. To this end, consider the following addition rule, 

argmax ^(5|A) = argmax [ min {f(S + A — ai), f(S + A — a 2 )} — 

|S|<m;SCAT—A |S|<m;SCiV-A 

min {/(A - ai), f(A - a 2 )}] 

, while ^ £ {ai,a 2 }, for suitable m > 1. The reason we need to resort to m- 
tuples instead of singletons is because, for m = 1 we cannot guarantee improve¬ 
ments at each iteration, as there need not be any element that has marginal 
value on both ai and a 2 . However, for larger m we can show improving guar¬ 
antees. More concretely, consider an instance where f(a{) = /(a 2 ) = 1, ai has 
a copy a[ and additionally both ai and a 2 have ‘partial’ copies, /(a^) = jr 
and /(a^|aj) = 0 for j £ {l,...,fc}, i £ { 1 , 2 }. Also, let there be a set G 
of k — 2 garbage elements with /(G) = 0. Finally, let /({ai,a 2 }) = 2 and 
f(al\X) = j; if {at, a'} fl A' = 0. Running the algorithm with: (i) m = 1 out¬ 
puts {ai,a 2 } U G in the worst case, with (ii) m = 2 outputs a {, ai 2 on step j 
of Phase 1 and thus, ‘partially’ copies both a\ and a 2 . Instead, if we run the 
algorithm with (iii) m = 3, the algorithm picks up a} and then copies a 2 al¬ 
most completely with { a ..., a 2 ~ 3 }- In fact, we will show that while {ai, a 2 } 
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are minimizers, adding m-tuples in this manner allows us to guarantee that 
at each step we increase g(A) by f times the difference from optimal. 
Thus, when m is large enough that —« 1, we effectively add value at the 
‘greedy’ rate of ^ times the difference from optimal. However, this is while 
z € {ai, 02 }, so we need to address the case when {ai, 02 } are not minimizers. 
One approach would be to follow along the lines of Algorithm [I] by adding 
singletons greedily w.r.t. / (similar to Phase 3) once the minimizer falls out 
of { 01 , 02 }. More concretely, 


Algorithm 5 A (1 — 1/e) — e Algorithm for r = 1 

input: m 

1: Initialize A = {ai,a, 2 } 

Phase 1: 

2: while |A| < k and z(A) C { 01 , 02 } do 
3: l = min{m, k — |A|} 

4: A —A U argmax g(S\A) 

\S\=l-,SCN-A 

Phase 2: 

5: while \A\ < k do A = A + argmax/(ai| A) 

6: Output:^ xgN-A 

Before analyzing the approximation guarantee of Algorithm [5J we first need 

to show that in Phase 1, at each step we greedily add an m-tuple with marginal 
value m 7 1 times the difference from optimal. We do this by showing a more 
general property below, 

Lemma 9 Given two monotone submodular functions, f \, fi on ground set 
N. If there exists a set S of size k, such that fi(S) > Vi, Vi, then for every m 
(2 < m <k), there exists a set ICS with size m, such that, 

777 — 1 

MX) > ——v u w. 

Proof First, we show that it suffices to prove this statement for two mod¬ 
ular functions ■ Note that we can reduce our ground set to S. Now, 

consider an arbitrary indexing of elements in S = {si,..., Sk} and let Sj = 
{si,..., Sj},Vj £ [fe]. Consider modular functions such that value of element 
Sj is hi(sj) := fi(sj\Sj-i). Note that hi(S) = fi(S) and additionally, by sub¬ 
modularity, for every set ICS, fi( x ) = Y)j :Sj ex M s j\ x D Sj- 1 ) > hi(X). 
Also, we can assume w.l.o.g. that hj(S) = 1, Vi and we will show that hj(X) > 

m ~ 1 Vi 
k > vt - 

We proceed by induction on m. For the base case of m = 2, we can pick 
elements £ S such that /ii(ej) > and we are done. Now assume that 
the property holds for m < p and we show it for m = p + 1. Consider a set 
Xq of size p, such that hi(X o) > . Such a set exists by assumption, and 

note that if for some i, say i = 1, hi(Xo) > ?, we are done, since we can 
add an element e G S to X 0 such that /i 2 (e + X 0 ) > ?. So we assume that 
J A L <h i {X 0 )<l,Vi. 

Now, consider the reduced ground set S — Xq. Then we have that hi(S — 
Aq) > 1 — Vi and since |5 — Aq| = k — p, we now have by assumption that 
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there exists a set A'i of size p such that hi(X i) > — ATo) > Using 

the same arguments as above, we can assume that hi(X i) < f. We repeat this 
until | S — U Xj | < p. Then we end up with a set X' with \X'\ = p' < p and 

hi{X' ) > , V* since hi (S) = 1, Vi. lip' = 0, we have a contradiction and hence 

for some Xj and i, hi(Xj) > ?, and we are done. Otherwise if p' > 1, using the 
induction assumption for m = p+l—p' on the ground set S'—A', we have a set 
Y with \Y\ =p+l-p', such that hi(X'UY) > hi{X') + ^(l-hi{X')) > f. 
□ 

Based on this lemma, consider the below generalized greedy algorithm, 


Algorithm 6 Generalized Greedy Algorithm 
input: m, V\ , V 2 
1: Initialize A = 0 
2: while \A\ < k do 
3: m = min{m, k — |A|} 

4: Find |x| fi(X\A) > m ~ 1 [Vj — fi(A)],X G N — A, \X\ = by enumeration 

5: A = A UX 

6: Output:A 

Theorem 10 If there exists a set S, l^l = k, such that fi(S) > Vi, then 
Algorithm^ finds a set A, \A\ = l, such that, 

(/3(0, 1 ) - /2(1 /mj)Vi, 

by making 0(n m+1 ) queries. 

Proof The analysis closely follows that of Lemma [1] in [241E5] . Let A, be the 
set at iteration j, then using Lemma [9] for monotone submodular functions 
/'(.) = fi(.\Aj) with V( = Vi — fi(Aj) we have that 3X, |A| < m, such that 
fi(X\Aj) > ^[Vi - fi(Aj)]. Hence, MAj+i) > fMj) + - fi(Aj)]. 

Now similar to what is shown in [M1I2S1, this gives us that after l iterations, 

£ (' - 0 - 1 ) - n ^) v - 

Note the problem of maximizing the minimum of two monotone submodular 
functions is a special case of the above and hence for Algorithm [5j we have 
that while in Phase 1, after l iterations, 

g{A) = min{/(A - ai),f(A - a 2 )} 

> (/3(0, - 12(l/m)) min{/ (OPT(k, N, 1) - a\), f(OPT{k, N, 1) - a 2 )} 

> (/ 3 ( 0 , l) - n(l/m))g(OPT(k, N, 1 )) ( 10 ) 


Theorem 11 Given m > 2, Algorithm [5] is /3(0, k 2 ™ 2 ) — 17(l/m) 
(1 — 1/e) — 17(1 /m) approximate, and makes 0(n m+1 ) queries. 
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Proof Let Aq = {ai, a 2 }. Consider the function g°(S) = min ie { lj2 }{/(<S' — a^)} 
and let z°(S ) be the g° minimizer of a set S. Note that if A 0 D z(S) ^ 0 
then g°(S) = g(S). Also, with the standard definition of marginal, note that 
for any set S fl A 0 =0, t; 0 (5'|X) > min, e { li2 }{/(>S'|X — as*)} > f(S\X). Let 
U be the set added during Phase 1 and similarly W during Phase 2. Also, 
let U = {iti,..., Up}, where each Ui is a set of size m and similarly W = 
{u>i, ... ,uv}, where each Wi is a singleton. Let OPT = g(OPT(k, N, 1)) and 
note that if r = 0 i.e., Phase 2 doesn’t occur, we have that 2 £ A 0 and using 
lUDt . g(A) > [3(0, — f2(l/m)]OPT. So assume r > 0 and also p > 2, 

since the case p = 1 will be easy to handle later. Now, let f(z\A — z) = rjf(A), 
and so g(A) = (1 — rf)f{A). Using Lemma [5j we will show that, 


f(A) > P(sg ,~—^ 


OPT 


( 11 ) 


Then similar to Theorem 0 we have, g{A) > [3(0, k ~ 2 ™~ 2 ) - 12(1 /m)\OPT. 
To show m, let at = z°(A 0 U U). Then, observe that, 

f(A) = g°(A 0 U (U - Up)) + g°(u p \A 0 U (U - u p )) 

+f( a i\(A 0 UU) — at) + f(W\Ao U U) (12) 


For the first term in m, we have 

g°(A 0 U (U - Up)) = g°(A 0 U m) + g°({u 2 ,u p -i}\A 0 U iti). 


Then, due to the greedy nature of Phase 1, we have using Theorem [TUI 

g°({u 2 ,..., Up-JIAo U Ul ) > >9(0, m ~ 1 mp ~ 2m ))OPT - g°(A 0 U Ul ){13) 

m k 

Similarly, for the last term in m, we have, 


f(W\A 0 U U)> /3(0, \){OPT - f(A 0 U U)) (14) 


Now we make some substitutions, let A = g 0 (AoUui) + g 0 (upjAoU(l7 — u p )) + 
f{a,i\(A 0 U U) — a p = a r = Thus, a p + a r = 1 - ~ 

1 — 1/m. Also, f(Ao U U) = A + g°({u 2 , ..., n p _i}|A 0 U u\). This gives us, 

f(A) = A + g°({u 2 ,..., u p -i}\A 0 U m) + f(W\A 0 U U) 


> A + g°({u 2 ,u p -i}\A 0 U Ul ) + > 9 ( 0 , a r )[OPT - f(A 0 U U)} 

> A + (1 - > 3 ( 0 , a r ))g°({u 2 ,u p -i}\A 0 U ui) + 3 ( 0 , a r )(OPT - A) 


>A + (l/e ar )3(0, a p )(OPT - <?°(Ao U m)) + 3(0, a r ){OPT - A) 
>A + (3(0, a r ) + 3(0, a p )/e^)[OPT - A] 

> A + (1 — 1/e — 12(1 /m))[OPT - A] 


(a) comes from da and ( b ) from (fl3l) . Next, we will show that for any x, 
3f(x\A — x) < A, which implies that A > 3r]f(A). This combined with Lemma 
El gives us (fill) . 
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Now, to show the desired lower bound on A, we will first show in two steps 
that for all x, 2f(x\A — x) < g°(A 0 U u\) = f(d 2 ) + g°(ui\A 0 ). 

Step 1: for all x, f(x\A — x) < /(a 2 ). To see this for x / ai, note that 
f(x\A — x) < f(x\A 0 — x) and further, f(x\A 0 — x) < f(d2\ai) < f(d2), where 
the first inequality is because 02 adds maximum marginal value to a ±. For 
x = ai, since Phase 1 ends, we have that f(ai\A — a\) < /(ai|a 2 + U) < 
f(y\(A 0 \JU) — y) for some y not in A 0 and then we have f(y\(A 0 U U) — y) < 
f(a 2 ) as before. 

Step 2: f(x\A — x) < g°(ui\A 0 ). For x qL Aq, we have that f(x\A — x) < 
f(x\A 0 ) < g°(x\A 0 ) < ( 7 °(iti|Ao). For x £ A 0 , from the fact that A 0 U U 
has a minimizer y fL Ao, we have that f(x\A — x) < f(x\(Ao U U) — x ) < 
f(y\(A 0 U U)-y) and further f(y\(A 0 U U) - y) < f(y\A 0 ) < g°(u i|A 0 ) as 
before. 

Finally, we show that f(x\A—x) < g°(up\AQ\j(U—Up))+f(ai\(A Q \JU)—ai), 
for all x. Let dj = z° (A 0 U (U — u p )) and observe that, 

g°(A 0 UU)- g°(A 0 U (17 - u p )) + f(ai\(A 0 U 17) - a,) 

= f((A 0 U 17) - - f((A 0 - dj) U (17 - u p )) + /(oi|(-4 0 U 17) - a*) 

= f(A 0 U 17) - f((A 0 - aj) U (17 - Up)) 

> f(aj\(A 0 - aj) U (U - Up)) 

Now, before adding we have that g°(.) = g(.) and in fact, dj is a minimizer 
of Aq U (17 — Up), so clearly, for all £ £ A 0 U (17 — u p ), the desired is true. 
For x G Up, it is true since f(u p \A 0 U (17 — u p )) < g°(u p \A 0 U (17 — u p )). For 
x G W, we have that f(x\A — x) < f{x\A 0 U(U — u p )) < g°(x\A 0 U(U — u p )) < 
g°(u p \A 0 U (17 — Up))), and we are done. 

To complete the proof, we need to show that 3f(z\A — z) < g°(A 0 U ui) + 
g°(u p \A 0 U (17 — u p )) + f(di\(A 0 LS U) — aj) implies (fTH . This follows from 
Lemma[5l with N replaced by N — d\, l = k— |Ao| — |ui| — \u p \ = k — 2 — 2 m 
and A as defined. 

The case p = 1 can be dealt with same as above, except that now A = 
g°(A 0 U Ml) + f(di\(A 0 Ulti) - di) + f(wi\A 0 U m) = f(A 0 U ui Uwi). □ 

Before discussing a similar result for constant r > 1, we first describe a fast 
0.387 approximation for r = o(Vk), which we will use when showing a (1 — 
1 /e) — e approximation for fixed r. 

3.2.2 0.387 Algorithm for r = o(\fk) in the possible absence of ‘‘copies” 

One can recast the first algorithm in Section 13.1.21 which greedily chooses 
{ai,..., cifc_ 2 r 2 } elements and adds r copies for each of the first 2 t elements, as 
greedily choosing 2 r elements, ignoring them and choosing another 2 r greedily 
(which will be copies of the first 2r) and repeating this r times in total, leading 
to a set which contains A 21 - and r — 1 copies of each element in A 2 T . Then, 
ignoring this set, we greedily add till we have k elements in total. Thus, the 
algorithm essentially uses the greedy algorithm as a sub-routine r -f 1 times. 
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Based on this idea, we now propose an algorithm for r = o(\fk), which can 
also be viewed as an extension of the 0.387 algorithm for r = 1. It achieves an 
asymptotic guarantee of 0.387 for r = o(Vk) and this degrades proportionally 
to 1 — T as r approaches J Let t' = r 2 logfc, then, 


Algorithm 7 Algorithm for r < 



1 

2 

3 

4 

5 

6 


Initialize Aq = A\ = 0. 
while |Ao| < t' do 

while |X| < t'/t do X = X + 

Aq = Aq U X 


argmax f(x\X) 
xeN-(A 0 ux) 


while \A±\ < k — r' do A\ = A\ + argmax f(x\A\) 
Output: A 0 U Ai xeiV-(A 0 uA 1 ) 


Theorem 12 Algorithm [7] has an asymptotic approximation ratio of !f e \ ~ 
0.387 for t = o(Vk). 

Proof Let A = Aq U A\ be the output with Aq, Ai as in the algorithm. Define 
Zq = Aq n Z and Z\ = Z — Zq = A\ D Z. Let OPT(k — r, N — Zq, 0) = Aq U A' 
where A r 0 = OPT(k — t,N — Zq , 0) D (Aq — Zq) and hence X C N — Aq. Now 
note that, 

f(A 0 - Zq) + f(OPT(k — r,N — Aq, 0)) > f(A' 0 ) + f(X) 

> f(OPT(k — t,N — Z 0 , 0)) 

> g(OPT(k, N, t)) [y Lemma[2] 

Which implies, 

f(Ao - Zq) > g(OPT(k, N, r)) - f(OPT(k - t, N - A 0 ,0)) (15) 


In addition, 

/(Tli) > p(0, f(OPT(k — t,N — Ao, 0)) (16) 

Next, index disjoint subsets of Ao based on the loop during which they were 
added. So the subset added during loop i is denoted by Aq, where i € {1,..., r}. 
So the last subset consisting of r logfc elements is Aq, . 

Now, if Zq includes at least one element from each Aq then Z\ = 0 and for 
this case we have from m and m above, 


f(A-Z) >max{/(A 0 -Z 0 ),/(A 1 )} 

> ma x{g(OPT(k,N,r)) - f(OPT(k - t, N - A o ,0)), 

j8(0, ^ )f(OPT(k — t, N — A 0 ,0))} 


> 


1 + 

k—too 




g(OPT(k, N,t)) 


g(°PT(k,N,T)) 
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Next, suppose that \Zi\ = m > 0, then there is some A : 0 for such that A : 0 nZ = 
0. Further let f(Zi\A— Z) = r/f(Ai), then since > |Zi|logfc, similar to 
© we have due to greedy iterations, f(Mf) > (log k)f(Zi\Ai. U A\ — ZA > 
{\ogk)f(Zi\A - Z) = ? 7 (log k)f(Ai) and also, 

/(A - Z) = [/(A,) + f(A 0 \Ai)} - [f{Zy\A -Z) + f(Z 0 \A - Z 0 )] 

> /(Ai) - f{Z x \A -Z) + /(A 0 |A - A 0 ) - f(Z 0 \A - Z 0 ) 
>(1-t ? )/(A 1 ) (17) 


Moreover, let A! x be the set of first r elements of A\. Then, due to greedy 
iterations we have /(A^) > f(Zi\A\ — Z\) > rjf(Ai). Thus, from Lemma [5j 
with N replaced by N — A 0 , k replaced by k — r, S = A\ with s = r) and 
l = k — |<S| = k — t' — t, we have, 

/(Ar) > P(r,, k ~ k T '_~ T )f(OPT(k -r,N- A 0 , 0)) (18) 

From (fl5ll . (H71) and (fl8l) . 


/(A - Z) > max{/(A 0 - Z 0 ), (1 - t 7 )/(A 1 ), /(Aj)} 

> max{/(A 0 - Z 0 ), (1 - ? 7 )/(Ai),? ? (logfc)/(Ai)} 

(a) 

> ma x{g(OPT(k, N, r)) — f{OPT(k — t,N — Aq, 0)), 


logfc 


-A-- 


i 


k — t' — T 


> 


1 + log k 1 + log k k — t 

log k q ^ 1 k—r' — T 


)f{OPT(k — t,N — A 0l 0))} 


1+logfc^ V 1+logfc’ k — r 


1 + 

k—yoo 
-> 


log k 
1+log k 

e — 1 


^(l+logfc’ 


k—r'—r 




g{OPT(k,N, T )) 


g(OPT(k, N,t)) 


2 e — 1 ' 

where (a) follows by substituting 77 = l+ ] og k ■ 


□ 


3.2.3 (1 — 1/e) — e algorithm for constant r 

As we saw in Section 13.2.11 the analysis of Algorithm [5] is fairly technical 
and additionally, the phase wise approach doesn’t generalize well to r > 1 
since we can have a minimizer that intersects with the initial set but is not 
a subset of the initial set, unlike for r = 1 where a minimizer z is either in 
{ai,a2} or not. Then for t = 1, an alternative approach comes from realiz¬ 
ing that we would like to build a set that simultaneously has large value on 
both /i(.) = /(.|ai) and / 2 O) = /(.|a 2 ). Also, we can capture the notion of 
continuing greedily w.r.t. /(.) once the set becomes robust to removal of ei¬ 
ther of or 02 , by considering a third function /a(.) = /(.|{ai, 02 }). Thus, 
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we can think about a multi-objective problem over the monotone submodu¬ 
lar functions fi,f 2 ,fa, and try to add a set A\ to Aq = { 0 , 1 , 02 }, such that 
fi{A\) > (1 — l/e)fi(OPT(k,N, 1)),V* since that, for instance, gives us, 

f{Ai + A 0 — 02 ) = fi(Ai) + /(oi) 

> /(or) + (1 - 1/ e)(f(OPT(k, N , 1)) - /( ffll )) 

> (1 — l/e)g(OPT(k, N, 1)) 

and similarly for f 2 , fz- Generalizing this for larger r, we start with the set 
Aq obtained by running Algorithm [7] with r' = 3r , implying |Ao| = 3r 2 . 
Now, consider the monotone submodular functions /$(.) = f(.\Yi) for every 
possible subset (|1}| > 3r 2 — r) of A 0 and denote the set of functions by 
L (\L\ = 2 n( - T \ large but constant). We would like to find a set A of size 
k — 3t 2 such that fi(X) > (1 — 0(^))[g(OPT(k, N,t)) — /(T))]. We know 
that OPT(k,N , 1) is a set such that, fi(OPT(k, N,t)) > f(OPT(k,N,r)) — 
f(Yi) > g(OPT(k,N,r)) — /(A)), however it is a set of size k. So we need 
the proof of existence of a set S of size k — 3r 2 such that, fi(S) > (1 — 
e$))fi(OPT(k,N,l)),Vi. 

Lemma 13 Given a constant number of monotone submodular functions L = 
{fi,. ■■,/;} and values V = {V),...,Vj} and a set S' of size k, such that 
fi(S') > Vi,Vi. There exists a set S of size k—p such that, 

MS) a 0 - k- {r l + i-i ) Y V <=(‘ - e(1 ' k ^ v ‘ 


for constant l,p <C fc. 

Proof This is clearly true for l = 1 since 3 S C S', |S| = k — p, such that, 
f(S) > ^Y^f(S'). More generally, we show that 3<Si C S, |5i| = k — 1, such 
that fi(Si) > jfrff(S'). Then we can reapply this on Si to get S 2 C Si, IS 2 I = 
k — 2 and fi(S 2 ) > jr^fErf(Si). Repeating this p times over all, we get 
S p C S, |S P | = k—p and fi{S p ) > /Zj~ q T ^i, which gives the desired. So, 

it remains to show that we can remove an element S' — Si from S', such that 

MS'- Si\Si) < -ihMS'). 

Let S' = {s},...,4} and let S' = {s},..., s'}. Now, as we argued in 
Lemma [9] it suffices to show this for modular functions hi where /li(s' ) = 
fi{ s 'j\S'j_i). W.l.o.g., assume that S' is indexed such that /ii(s') < /ii(s' +1 ). 
Consider the set S 1 { of the 1 + \^-f-k~\ smallest elements w.r.t. hi, then 

3jo < 1 + suc h that s' o is one of the smallest elements for 

each function hi,i > 2. This is true since there are at most (k — &;])(£ — 

1) < distinct elements which are in the top f° r some function 

hi , i > 2. Now removing Sj 0 decreases the value of any hi by at most a factor 
It! ~ 1 k—2l 

l?t - k ~ i ■ 
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Now that we know that set S of size k — 3r 2 satisfying the desired inequalities 
exists, we need an algorithm to find a set that (1 — l/e)-approximates the 
value of S for each While Algorithm [G] only works for up to two functions, 
as mentioned in the beginning, a randomized (1 — 1/e — e) algorithm for a 
more general problem was given in [3] and we use it here. 

Lemma 14 (Chekuri, Vondrak, Zenklusen [9]) Given constant number 
of monotone submodular functions L = {/*,..., //} and values V = { V ,..., Vi}. 
If there exists a set S (jSj < k) with fi(S) > Vi, Vi, there is a polynomial time 
algorithm which finds a set X of size < k such that fi(X) > (1 — 1/e — e)Vi,Vi, 
with constant probability, for a constant e < j^rj, making OinfV ) queries. If 
a set S with the value lower bounds given by V doesn’t exist, then the algorithm 
gives a certificate of non-existence. 

Denote the algorithm by A and the let set output by this algorithm be A(V, L ), 
for inputs V, L as described above. To ease notation, from here on we generally 
ignore the e term. 

A final hurdle in using the above is that we need to input the values 
Vi = g(OPT(k,N,r)) — f(Si) and hence, we need an estimate of OPT = 
g(OPT(k, N,t)). We can overestimate OPT as long as the problem remains 
feasible. However, underestimating OPT results in a loss in guarantee. Using 
Algorithm [3 we can quickly find lower and upper bounds lb,ub such that 
lb < OPT < ub = lb/0.387. Now we do binary search, starting with V = 
{ub— If we find a set A(V,L), we are done, else we find that such a 

set doesn’t exist and we next try with V = { ub + lb — and so on until 

we find an estimate OPT' > (1 — S)OPT in 1/log(l + d) iterations. 

Thus, our scheme uses the algorithm for multi-objective optimization as a 
subroutine to find the estimate OPT' > (1 — 5)OPT and simultaneously a set 
Ai = A({OPT' — f{Si)}i, L) of size k — 3r 2 , such that fi(Ai) > (1 — l/e)(l — 
0(l/k))(OPT' - f{Si)). 

The final output is A = AqUAi and we next show that this is asymptotically 
(1 — 1/e — e) approximate. 

Proof We ignore the e and 0(1/k) terms to ease notation. First, note that if 
Z C Ao, using f(Ai\A 0 — Z) > (1 — 1 /e)(OPT' — f(A 0 — Z)), we have that 
f(Ai +A 0 -Z)= g(A ) > (1 - 1 /e)OPT'. 

If Z ^ Ao, then let Z± = Z (~l A± and Zq = Z — Z\. Similar to the proof 
of Theorem [13 we have due to greedy additions f(A 0 — Zq) > 3f(Zi\A — Z) 
(in contrast with (log k)f(Z\\A — Z)). Now, since f(.\A 0 — Zq) is one of the 
functions in L , we have that f(A — Zq) > f(A 0 — Zq) + (1 — 1 /e)(OPT' — 
f(A 0 — Zo)). Now with f(Zi\A — Z 0 ) = tjf(A — Z 0 ), this becomes f(A— Z 0 ) > 
f f(A - Zq) + (1 - 1/ e)OPT', giving us 

f(A - Zq) > /3(3 V , 1 )OPT'. 

Now, similar to Lemma [G] we have g(A) = f(A — Z) > (1 — r])f(A — Zq) and 
hence g(A) > /3(0, l)OPT'. □ 
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Finally, consider the following generalization of Lemma 1101 

Conjecture 15 Given l > 1 (treated as a constant) monotone submodular 
functions f \,..., fi on ground set N, a set S C N of size k, such that fi(S) > 
Vi for all i £ [I] and an arbitrary m with s > m > l, there exists a set X C S 
of size m, such that, 

fi(X)> TO ~ fc 0(1 V i, Vie[Z] 

If true, the above would give a greedy deterministic (1 — 1/e) — fi(l/m) ap¬ 
proximation for the multi-objective optimization problem and thus also for 
our robust problem, for fixed r. 

4 Extension to general constraints 

So far, we have looked at a robust formulation of PI, where we have a car¬ 
dinality constraint. However, there are more sophisticated applications where 
we find instances of budget or even matroid constraints. In particular, consider 
the generalization max min f(A\B), for some independence system X. By 

definition, for any feasible set A £ X, all subsets of the form A\B are feasible as 
well, so the formulation is sensible. Let’s briefly discuss the case of r = 1 and 
suppose that we are given an a approximation algorithm A, with query/run 
time O(R) for the r = 0 case. Let Go denote its output and zo be a minimizer 
of Go- Consider the restricted system X ZQ = {A : z 0 £ A, A £ X}. Now, in 
order to be able to pick elements that have small marginal on zq but large 
value otherwise, we can generalize the notion of ignoring zq by maximizing 
the monotone submodular function f(.\zo) subject to the independence sys¬ 
tem I Zo . However, unlike the cardinality constraint case, where this algorithm 
gives a guarantee of 0.387, the algorithm can be arbitrarily bad in general (be¬ 
cause of severely restricted I Zo , for instance). We tackle this issue by adopting 
an enumerative procedure. 

Let Aj denote the algorithm for r = j and let Aj(N, Z ) denote the output 
of Aj on ground set N and subject to restricted system Xz- Finally, let z(A) = 

argmax/(x). With this, we have for general constraints: 
i6 A 


Algorithm 8 A T : yqq for General Constraints 

1: Initialize i = 0, Z = 0 
2: while N — Z ^ 0 do 
3: Gi = Ao(N - Z,H) 

4: Zi£z(Gi); Z = Z U Zi 

5: Mi = Zi U At— i (JV — Z, Zi)-, i = i+ 1 

6: Output: argmax{g r (S)\S £ {Gj} l j=0 U {MjY j=0 } 


To understand the basic idea behind the algorithm, assume that Zq is in 
an optimal solution for the given r. Then, given the algorithm A r - i, if a 
minimizer of the set Mq = zq U A t -i(N — zq, zo) includes z o, it only removes 
r—1 elements from A T -\(N— zq, zq). On the other hand, if a minimizer doesn’t 
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include zq, g T (M 0 ) > f(zo) > P m A~ 9 t(m 0 ) . These two cases yield the desired 
ratio, however, since zo need not be in an optimal solution, we systematically 
enumerate. 

Theorem 16 Given an a approximation algorithm A for t = 0 with query 
time O(R), algorithm A T described above guarantees ratio for general r 
with query time 0(n T R + n T+1 ) 

Proof We proceed via induction. Clearly, for i = 0, Ao = A, and the statement 
holds. Assume true for i G {0,1,..., r — 1}, then we show validity of the claim 
for A t ■ The query time claim follows easily since the while loop runs for at most 
n iterations and each iteration makes 0(n T + n T ~ 1 R) queries (by assumption 
on query time of A t -i) and updating the best solution at the end of each 
iteration (counts towards the final output step) takes 0(n T ) time to find the 
minimizer by brute force for two sets Gi and Mi. 

Now, let OPT (I , N, r) denote an optimal solution to max^gj min|B| =T f{A— 
B) on ground set N and assume that zo G OPT(X, N,t). For any minimizer 
B of A, we have for every element z G z(A ), f(z) > > A b J a ) _ L e ^ 

denote a minimizer of Gq. Hence, if zq fL Zo, we have that g T (Go) > f(zo) > 
and additionally g T {Go) > /(Go) — /(Zo), giving us the desired since 
g T (Go) > max{^^, /(Go) — /(Zo)} > . If zq is in the minimizer of Go 

and if /(G 0 - Z 0 ) < then we have that /(Z 0 |G 0 - Z 0 ) > y^y/(G 0 ), 

implying that f(zo) > ^+ 1 ^ • N° w , let Zq denote the minimizer of M 0 and 
note that if zo fL Z' Q , we are done. Else, we have that, 

gr(M 0 ) = g r -i(M 0 - z 0 ) > -g r -i(OPT(l Zo ,N - z 0 ,t- 1)) 

T 

> —g T - 1 (OPT(l, N,t) — z 0 ) 

T 

> -g T (OPT(l,N } r)) > -2-g T (OPT(X, N,t)) 

T T + 1 

Where the first inequality stems from the induction assumption, the second 
and third by our assumption on zo- This was all true under the assump¬ 
tion that z o G OPT (I, N, r), if that is not the case, we remove zq from the 
ground set and repeat the same process. The algorithm takes the best set out 
of all the ones generated, and hence there exists some iteration l such that 
Zi G OPT(X,N,t) and analyzing that iteration as we did above, gives us the 
desired. 

Finally, for the cardinality constraint case, we can avoid enumeration al¬ 
together and the simplified algorithm has runtime polynomial in (n, t) and 
guarantee that scales as -, which for \[Xk < r = o(/c), is a better guarantee 
than the naive one of -A— from Section [PI □ 


5 Conclusion and Open Problems 

We looked at a robust version of the classical monotone submodular function 
maximization problem, where we want sets that are robust to the removal of 
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any r elements. We introduced the special, yet insightful case of copies, for 
which we gave a fast and asymptotically (1 — 1/e) approximate algorithm for 
r = o(k). 

For the general case, where we may not have copies, we gave a deter¬ 
ministic asymptotically (1 — 1/e — f2(l/m)) algorithm for r = 1, with the 
runtime scaling as n m+1 . As a byproduct, we also developed a determinis¬ 
tic (1 — 1/e) — approximation algorithm for bi-objective monotone 

submodular maximization, subject to cardinality constraint. For larger but 
constant r, we gave a randomized (1 — 1 /e) — e approximation and conjectured 
that this could be made deterministic. Additionally, we also gave a fast and 
practical 0.387 algorithm for r = o(y/k). Note that here, unlike in the special 
case of copies, we could not tune the algorithm to work for larger r and it is still 
open whether there is a constant factor approximation for r = o(fc), when we 
may not have copies. The best ratio we have in this regime is (1 — l/e)/(r+1), 
from the algorithm in the previous section. 

Finally, similar robustness versions can be considered for maximization 
subject to independence system constraints and we gave an enumerative black 
box approach that leads to an approximation algorithm with query time 
scaling as n T+1 , given an a approximation algorithm for the non-robust case. 

Acknowledgement 

Rajan Udwani would like to thank Jan Vondrak for a useful discussion and 
pointing out a relevant result in [9]. 

References 

1. A. Badanidiyuru and J. Vondrak. Fast algorithms for maximizing submodular functions. 
In SODA ’14, pages 1497-1514. SIAM, 2014. 

2. A. Ben-Tal, L. El Ghaoui, and A. Nemirovski. Robust optimization. Princeton University 
Press, 2009. 

3. D. Bertsimas, D. Brown, and C. Caramanis. Theory and applications of robust opti¬ 
mization. SIAM review, 53(3):464-501, 2011. 

4. D. Bertsimas and M. Sim. Robust discrete optimization and network flows. Mathemat¬ 
ical programming , 98(1-3):49-71, 2003. 

5. D. Bertsimas and M. Sim. The price of robustness. Operations research, 52(1):35—53, 
2004. 

6. N. Buchbinder and M. Feldman. Deterministic algorithms for submodular maximization 
problems. CoRR, abs/1508.02157, 2015. 

7. N. Buchbinder, M. Feldman, J.S. Naor, and R. Schwartz. A tight linear time (1/2)- 
approximation for unconstrained submodular maximization. FOCS ’12, pages 649-658, 
2012 . 

8. G. Calinescu, C. Chekuri, M. Pal, and J. Vondrak. Maximizing a monotone submodular 
function subject to a matroid constraint. SIAM Journal on Computing, 40(6): 1740- 
1766, 2011. 

9. C. Chekuri and R. Vondrak, J. a Zenklusendn. Dependent randomized rounding via 
exchange properties of combinatorial structures. In FOCS 10, pages 575-584. IEEE, 
2010 . 

10. S. Dobzinski and J. Vondrak. From query complexity to computational complexity. In 
STOC ’12, pages 1107-1116. ACM, 2012. 

11. U. Feige. A threshold of In n for approximating set cover. Journal of the ACM (JACM), 
45(4): 634-652, 1998. 



28 


James B. Orlin et al. 


12. U. Feige, V.S. Mirrokni, and J. Vondrak. Maximizing non-monotone submodular func¬ 
tions. SIAM Journal on Computing, 40(4):1133-1153, 2011. 

13. M. Feldman, J.S. Naor, and R. Schwartz. A unified continuous greedy algorithm for 
submodular maximization. In FOCS ’ll, pages 570-579. IEEE. 

14. M. Feldman, J.S. Naor, and R. Schwartz. Nonmonotone submodular maximization via 
a structural continuous greedy algorithm. In Automata, Languages and Programming, 
pages 342-353. Springer, 2011. 

15. S.O. Gharan and J. Vondrak. Submodular maximization by simulated annealing. In 
SODA ’ll , pages 1098-1116. SIAM. 

16. A. Globerson and S. Roweis. Nightmare at test time: robust learning by feature deletion. 
In Proceedings of the 23rd international conference on Machine learning , pages 353-360. 
ACM, 2006. 

17. D. Golovin and A. Krause. Adaptive submodularity: Theory and applications in active 
learning and stochastic optimization. J. Artificial Intelligence Research, 2011. 

18. C. Guestrin, A. Krause, and A.P. Singh. Near-optimal sensor placements in gaussian 
processes. In Proceedings of the 22nd international conference on Machine learning, 
pages 265-272. ACM, 2005. 

19. S. Iwata, L. Fleischer, and S. Fujishige. A combinatorial strongly polynomial algorithm 
for minimizing submodular functions. Journal of the ACM (JACM), 48(4):761-777, 
2001 . 

20. A. Krause, C. Guestrin, A. Gupta, and J. Kleinberg. Near-optimal sensor placements: 
Maximizing information while minimizing communication cost. In Proceedings of the 
5th international conference on Information processing in sensor networks, pages 2-10. 
ACM, 2006. 

21. A. Krause, H B. McMahan, C. Guestrin, and A. Gupta. Robust submodular observation 
selection. Journal of Machine Learning Research, 9:2761-2801, 2008. 

22. J. Leskovec, A. Krause, C. Guestrin, C. Faloutsos, J. VanBriesen, and N. Glance. Cost- 
effective outbreak detection in networks. In Proceedings of the 13th ACM SIGKDD in¬ 
ternational conference on Knowledge discovery and data mining, pages 420-429. ACM, 
2007. 

23. Y. Liu, K. Wei, K. Kirchhoff, Y. Song, and J. Bilmes. Submodular feature selection 
for high-dimensional acoustic score spaces. In Acoustics, Speech and Signal Processing 
(ICASSP), 2013 IEEE International Conference on, pages 7184-7188. IEEE, 2013. 

24. G.L. Nemhauser and L.A. Wolsey. Best algorithms for approximating the maximum of 
a submodular set function. Mathematics of operations research, 3(3): 177-188, 1978. 

25. G.L. Nemhauser, L.A. Wolsey, and M.L. Fisher. An analysis of approximations for 
maximizing submodular set functions—i. Mathematical Programming, 14(l):265-294, 
1978. 

26. A. Schrijver. A combinatorial algorithm minimizing submodular functions in strongly 
polynomial time. Journal of Combinatorial Theory, Series B, 80(2):346-355, 2000. 

27. M. Sviridenko. A note on maximizing a submodular set function subject to a knapsack 
constraint. Operations Research Letters, 32(l):41-43, 2004. 

28. M. Thoma, H. Cheng, A. Gretton, J. Han, HP. Kriegel, A.J. Smola, L. Song, S.Y. Philip, 
X. Yan, and K.M. Borgwardt. Near-optimal supervised feature selection among frequent 
subgraphs. In SDM, pages 1076-1087. SIAM, 2009. 

29. J. Vondrak. Optimal approximation for the submodular welfare problem in the value 
oracle model. In STOC ’08, pages 67-74. ACM. 

30. J. Vondrak. Symmetry and approximability of submodular maximization problems. 
SIAM Journal on Computing, 42(l):265-304, 2013. 

31. J. Vondrak, C. Chekuri, and R. Zenklusen. Submodular function maximization via the 
multilinear relaxation and contention resolution schemes. In STOC ’ll, pages 783-792. 
ACM, 2011. 




